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ABSTRACT. Almost from the inception of Hilbert's program, founda- 
tional and structural efforts in proof theory have been directed towards 
the goal of clarifying the computational content of modern mathemati- 
cal methods. This essay surveys various methods of extracting compu- 
tational information from proofs in classical first-order arithmetic, and 
reflects on some of the relationships between them. Variants of the Godel- 
Gentzen double-negation translation, some not so well known, serve to 
provide canonical and efficient computational interpretations of that the- 
ory. 

1 Introduction 

Hilbert's program was launched, in 1922, with the specific goal of demon- 
strating the consistency of modern, set-theoretic methods, using only finitary 
means. But the program can be viewed more broadly as a response to the 
radical methodological changes that had been introduced to mathematics in 
the late nineteenth century. Central to these changes was a shift in mathemat- 
ical thought whereby the goal of mathematics was no longer viewed as that of 
developing powerful methods of calculation, but, rather, that of characterizing 
abstract, possibly infinite, mathematical structures, often in ways that could 
not easily be reconciled with a computational understanding. 

Grisha's work over the years has touched on almost every aspect of proof 
theory, both of the reductive (foundational) and structural sort, involving a 
wide range of logical frameworks. But much of his work addresses the core 
proof-theoretic concern just raised, and has served to provide us with a deep 
and satisfying understanding of the computational content of nonconstructive, 
infinitary reasoning. Such work includes his characterization of the provably 
total computable functions of ISi as exactly the primitive recursive functions 
([28, 29]); his method of continuous cut elimination, which provides a finitary 
interpretation of infinitary cut-elimination methods ([32, 11]); and his work on 
the epsilon substitution method (for example, [35, 37]). 

Grisha has also been a friend and mentor to me throughout my career. 
The characterization of the provably total computable functions of ISi just 
mentioned was, in fact, also discovered by Charles Parsons and Gaisi Takeuti, 
all independently. I shudder to recall that at a meeting at Oberwolfach in 1998, 
when I was just two- and- a- half years out of graduate school, I referred to the 
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result as "Parsons' theorem" in a talk before an audience that, unfortunately, 
included only the other two. Grisha asked the first question after the talk was 
over, and nothing in his manner or tone even hinted that I had made a faux pas 
(it didn't even occur to me until much later). In fact, I still vividly remember 
his encouraging and insightful comments, then and in later discussion. (For 
the record, Gaisi was equally gracious and supportive.) 

In this essay, I will discuss methods of interpreting classical first-order arith- 
metic, often called Peano arithmetic (PA), in computational terms. Although 
the study of PA was central to Hilbert's program, it may initially seem like a 
toy theory, or an artificially simple case study. After all, mathematics deals 
with much more than the natural numbers, and there is a lot more to math- 
ematical argumentation than the principle of induction. But experience has 
shown that the simplicity of the theory is deceptive: via direct interpretation 
or more elaborate forms of proof-theoretic reduction, vast portions of mathe- 
matical reasoning can be understood in terms of PA [4, 13, 43]. 

Here, I will be concerned with the n 2 , or "computational," consequences of 
PA. Suppose PA proves Vx 3y R(x,y), where x and y range over the natural 
numbers and R(x, y) is a decidable (say, primitive recursive) relation. We would 
like to understand how and to what extent such a proof provides an algorithm 
for producing such a y from a given x, one that is more informative than blind 
search. There are four methods that are commonly used to extract such an 
algorithm: 

1. Godcl's Dialectica interpretation [18, 6], in conjunction with a double- 
negation interpretation that interprets PA in its intuitionistic counter- 
part, Heyting arithmetic (HA) 

2. realizability [22, 23, 27, 45], again in conjunction with a double-negation 
translation, and either the Friedman A-translation [14] (often also at- 
tributed to Dragalin and Leivant, independently) or a method due to 
Coquand and Hofmann ([12, 1]) to "repair" translated II2 assertions 

3. cut elimination ([15]; see, for example, [41]) 

4. the epsilon substitution method ([19, 8]) 

These four approaches really come in two pairs: the Dialectica interpretation 
and realizability have much in common, and, indeed, Paulo Oliva [39] has 
recently shown that one can interpolate a range of methods between the two; 
and, similarly, cut elimination and the epsilon substitution method have a lot 
in common, as work by Grisha (e.g. [36]) shows. That is not to say that 
there aren't significant differences between the methods in each pairing, but 
the differences between the two pairs are much more pronounced. 

For one thing, they produce two distinct sorts of "algorithms." The Dialec- 
tica interpretation, and Kreisel's "modified" version of Kleene's realizability, 
extract terms in Godel's calculus of primitive recursive functionals of finite 
type, denoted PR" in Section 2 below. In contrast, cut elimination and the 
epsilon substitution method provide iterative procedures, whose termination 
can be proved by ordinal analysis. Specifically, one assigns (a notation for) an 
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ordinal less than £o to each stage of the computation in such a way that the or- 
dinals decrease as the computation proceeds. Terms in PR" and ^£ -recursive 
algorithms both have computational meaning, and there are various ways to 
"see" that the computations terminate; but, of course, any means of proving 
termination formally for all such terms and algorithms has to go beyond the 
means of PA. 

Second, as indicated above, the first two methods involve an intermediate 
translation to HA, while the second two do not. It is true that the Dialectica 
interpretation and realiz ability can be applied to classical calculi directly (see 
[42] for the Dialectica interpretation, and, for example, [2, 38] for realizability); 
but I know of no such interpretation that cannot be understood in terms of a 
passage through intuitionistic arithmetic [2, 5, 44]. In contrast, cut elimination 
and the epsilon substitution method apply to classical logic directly. That is 
not to deny that one can apply cut elimination methods to intuitionistic logic 
(see, for example, [46]); but the arguments tend to be easier and more natural 
in the classical setting. 

Finally, there is the issue of canonicity. Algorithms extracted from proofs in 
intuitionistic arithmetic tend to produce canonical witnesses to U 2 assertions; 
work by Grisha [33, 34] shows, for example, that algorithms extracted by var- 
ious methods yield the same results. In contrast, different ways of extracting 
witnesses from classical proofs yield different results, conveying the impression 
that there is something "nondetcrministic" about classical logic. (There is a 
very nice discussion of this in [47, 48]. See also the discussion in Section 6 
below.) Insofar as one has a natural translation from classical arithmetic to 
intuitionistic arithmetic, some of the canonicity of the associated computation 
is transferred to the former theory. 

In this essay, I will discuss realizability and the Dialectica interpretation, as 
they apply to classical arithmetic, via translations to intuitionistic arithmetic. 
After reviewing some preliminaries in Section 2, I will discuss variations of 
the double-negation interpretation in Section 3. One, in particular, is very 
efficient when it comes to introducing negations; in Section 4, I will show that, 
when combined with realizability or the Dialectica interpretation, this yields 
computational interpretations of classical arithmetic that are efficient in their 
use of higher types. In Section 5, 1 will consider another curious double-negation 
interpretation, and diagnose an unfortunate aspect of its behavior. 

This work has been partially supported by NSF grant DMS-0700174 and 
a grant from the John Templeton Foundation. I am grateful to Philipp Ger- 
hardy, Thomas Streicher, and an anonymous referee for helpful comments and 
corrections. 

2 Preliminaries 

Somewhat imprecisely, one can think of intuitionistic logic as classical logic 
without the law of the excluded middle; and one can think of minimal logic 
as intuitionistic logic without the rule ex falso sequitur quodlibet, that is, from 
_L conclude anything. Computational interpretations of classical logic often 
pass through minimal logic, which has the nicest computational interpretation. 
(One can interpret intuitionistic logic in minimal logic by replacing every atomic 
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formula A by A V _L, so the difference between these two is small.) 

To have a uniform basic to compare the different logics, it is useful to take 
the first-order logical symbols to be V, 3, A, V, — >, and J_, with -up defined to be 
p — > _L. However, when it comes to classical logic, it is often natural to restrict 
one's attention to formulas in negation-normal form, where formulas are built 
up from atomic and negated atomic formulas using A, V, V, and 3. A negation 
operator, ~<£, can be defined for such formulas; ~<£ is what you get if, in ip, 
you exchange A with V, V with 3, and atomic formulas with their negations. 
Note that ~~</? is just ip. Classically, every formula <p has a negation-normal 
form equivalent, <p nnf , obtained by defining (6 — s- n) nnf to be ^9 nnt V ?7 nnf , 
and treating the other connectives in the obvious way. This has the slightly 
awkward consequence that (^p) nnf translates to ^p nnf V _L, but simplifying 
6 V _L to 8 and A _L to _L easily remedies this. 

There are a number of reasons why negation-normal form is so natural for 
classical logic. First of all, it is easy to keep track of polarities: if <p is in 
negation-normal form, then every subformula is a positive subformula, except 
for, perhaps, atomic formulas; an atomic formula A occurs positively in ip if it 
occurs un-negated, and negatively if it occurs with a negation sign before it. 
Second, the representation accords well with practice: any classically-minded 
mathematician would not hesitate to prove "if <p then ip" by assuming ->ip and 
deriving -up, or by assuming ip and ^tp and deriving a contradiction; so it is 
convenient that ip — > ip, -tip — > -up, and -i(<p A -tip) have the same negation- 
normal form representation. Finally, proof systems for formulas in negation- 
normal form tend to be remarkably simple (see, for example, [41, 46]). 

It was Godel [18] who first showed that the provably total computable func- 
tions of arithmetic can be characterized in terms of the primitive recursive 
functionals of finite type (see [6, 20]). The set of finite types can be defined 
to be the smallest set containing the symbol N, and closed under an operation 
which takes types cr and r to a new type a — > r. In the intended ("full") 
interpretation, N denotes the set of natural numbers, and a — > r denotes the 
set of all functions from a to r. A set of terms, PR", is defined inductively as 
follows: 

1. For each type a, there is a stock of variables x, y, z, . . . of type a. 

2. is a term of type N. 

3. S (successor) is a term of type N — > N. 

4. if s is a term of type r —> a and t is a term of type r, then s(t) is a term 
of type a. 

5. if s is a term of type a and a; is a variable of type r, then Ax s is a term 
of type t — y (7. 

6. If s is a term of type a, and t is a term of type N — > (a — > a), then R st 
is a term of type N — > a. 

Intuitively, s(t) denotes the result of applying s to t, Xx s denotes the function 
which takes any value of x to s, and R st denotes the function defined from s 
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and t by primitive recursion, with R s t(0) = s and R s t(S(x)) — t(x, R st (x)) for 
every x. In this last equation, I have adopted the convention of writing t(r, s) 
instead of (t(r))(s). 

It will be convenient below to augment the finite types with products a x t, 
associated pairing operations (•, •), and projections (-)n and (-)i. Product types 
can be eliminated in the usual way by currying and replacing terms t with 
sequences of terms t%. It will also be convenient to have disjoint union types 
a + T, an element of which is either an element of a or an element of r, tagged 
to indicate which is the case. That is, for each such type we have insertion 
operations, inl and inr, which convert elements of type a and r respectively 
to an element of type a + r; predicates isleft(a) and isright(a), which indicate 
whether a is tagged to be of type a or r; and functions left(o) and right(a), 
which interpret a as an element of type a and r, respectively. References to 
such sum types can be eliminated by taking a + r to be N x a x r, defining 
inl(a) = (0, a, T ), defining inr(a) = (l,0 CT ,a), where CT and T are constant 
zero functionals of type a, t respectively, and so on. 

In the next section, I will describe various double-negation interpretations 
that serve to reduce classical arithmetic, PA, to intuitionistic arithmetic, HA 
— in fact, to HA taken over minimal logic. These show that if PA proves a 1I2 
formula Va; By R(x, y), then HA proves Va: — ■ — R(x, y); in fact, a variant HA' 
of HA based on minimal logic suffices. This reduces the problem to extracting 
computational information from the latter proof. 

One method of doing so involves using Kreisel's notion of modified realiz- 
ability, combined with the Friedman A-translation. One can extend HA to a 
higher-type version, HA U , which has variables ranging over arbitrary types, 
and terms of all the primitive recursive functionals. Fix any primitive recur- 
sive relation A(y); then to each formula ip(x) in the language of arithmetic, one 
inductively assigns a formula "a realizes ip(x)," as follows. 

a realizes _L = A(a) 

a realizes 9 = 6, if 9 is atomic 

a realizes ipAip = ((a)o realizes ip) A ((0)1 realizes ip) 

a realizes <pVip = ((isleft(a) A left(a) realizes ip) V 

(isright(a) A right(a) realizes ip j) 

a realizes ip —> ip = Mb (b realizes ip — > a(b) realizes ip) 

a realizes \/x ip(x) = \/x (a(x) realizes ip(x)) 

a realizes 3x <p>(x) = (a)i realizes ip((a)o) 

Now, suppose classical arithmetic proves Va; By R(x,y), for some primitive 
recursive relation R. Then, using a double-negation translation, HA' proves 
Va; ^~>By R(x,y), and hence it proves -i—3y R(c,y) for a fresh constant c. Fix 
A(y) in the realizability relation above to be the formula R{c,y). Inductively, 
one can then extract from the proof of term t of PR" such that HA" proves 
that t realizes — 1— i3y R(x, y). Now notice that the identity function, id, realizes 
-By R(c,y), since a realizer to By R(c,y) is simply a value of a satisfying 
R(c,a). Thus if a realizes By R(c,y), then a(id) satisfies R(c,a(id)). Viewing 
a, now, as a function of c, yields the following conclusions: 
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THEOREM 1. If classical arithmetic proves \/x -iSy R(x,y), there is a term 
F o/Pr of type N to N such that HA^ proves Vx R(x,F(x)). 

See [45, 24] for more about realizability, and [14] for the A-translation. 

Godel's Dialectica interpretation provides an alternative route to this result. 
In fact, one obtains a stronger conclusion, namely that the correctness of the 
witnessing term can be proved in a quantifier- free fragment PR^ of HA" . To 
each formula ip in the language of arithmetic, one inductively assigns a formula 
ip D of the form 3x Vy ipn{x,y), where x and y are now tuples of variables of 
appropriate types. Assuming ip D = 3u W tpo(u,v), the assignment is defined 
as follows: 



6 D = 6, if 9 is atomic 

(pAtp) D = 3x, u Vy, v (ip D Aip D ) 

(pVip) D = 3z My,v (islcft(z) A <p D (left (z),y) V 

(isright(z) A %p D (right (z), v))) 

(p^^) D = 3U,YVx,v(<p D (x,Y(x,v))^ip D (U(x),v)) 

(Vz<p(z)) D = 3XVz,y<p D (X(z),y,z) 

(3z p(z)) D = 3z,x\/y (p D (x,y,z) 



The clause for implication is the most interesting among these, and can be 
understood as follows: from a witness, x, to the hypothesis, U(x) is supposed 
to return a witness to the conclusion; and given a purported counterexample, 
v, to the conclusion, Y(x, v) is supposed to return a counterexample to the 
hypothesis. Since we have defined -up to be ip — > _L, notice that (-^•p) D is 
3YMx -^<p D (x,Y(x)). 

The Dialectica interpretation of Vx — >3y R(x,y) is 3Y \/x -*^R(x,Y(x)), 
which is intuitionistically equivalent to 3Y Vx R(x, Y(x)), given the decidabil- 
ity of primitive recursive relations. One can show that from a proof of <p in 
HA, one can extract a term F such that for every x, PR^ proves <p_d(x, F(x)), 
once again yielding Theorem 1. 



3 Some double-negation translations 

We have seen that one can use modified realizability or the Dialectica inter- 
pretation to extract an algorithm from a proof of a II2 statement in classical 
arithmetic, modulo a method of reducing classical arithmetic to intuitionistic 
arithmetic. Double negation translations provide the latter. 

A formula is said to be negative if it does not involve 3 or V and each atomic 
formula A occurs in the form ->A; in other words, the formula is built up from 
negated atomic formula using V, A, — >, and _L. Over minimal logic, negative 
formulas are stable under double negation, which is to say, if ip is any negative 
formula, then HA proves that -i-«p is equivalent to p (see, for example, [46]). 

The Godel-Gentzen double-negation translation maps an arbitrary first- 
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order formula ip to a negative formula, ip N : 



± N 


= _L 




= if 9 is atomic 




= p N A ip N 




= ^p N A^ N ) 




= y N -> 


(Vx <p) N 


= Vz^ 


(3x p) N 


= -Nx^fF 



The translation has the following properties: 
THEOREM 2. For any formula ip and set of sentences V: 

1. Classical logic proves ip «-> ^ 

2. // is provable from V in classical logic, then ip N is provable from T N in 
minimal logic. 

Since the HA proves the - N translations of its own axioms, we have as a corol- 
lary: 

COROLLARY 3. If PA proves <p, then HA proves <p N . 
In fact, one can strengthen the corollary in three ways: 

1. Since HA proves -i-i0 — > 9 for atomic formulas 9, one can define 9 N to be 
9. 

2. Assuming the language of HA includes, say, symbols denoting the primi- 
tive recursive functions, every negated atomic formula, ->6, has an atomic 
equivalent, 0; so one can define (^9) N to be 9. 

3. The theorem remains true if one replaces HA by a suitable variant, HA', 
based on minimal logic. 

These considerations hold in the theorems that follow as well. 

The reason to be concerned about negations is that they are undesirable 
with respect to the two computational interpretations given in the last section, 
since they lead to the use of more complicated types in the resulting terms 
of PR". There is a variant of the double- negation translation known as the 
Kuroda translation that fares slightly better in this regard: for any formula ip, 
let p Ku denote the result of doubly-negating atomic formulas, and adding a 
double negation after each universal quantifier, and, finally, adding a double- 
negation to the front of the formula. Then we have: 

THEOREM 4. For every formula p, p Ku o ip N is provable in minimal logic. 
Hence PA proves <p if and only if HA proves ip Ku . 

Note that intuitionistic logic, rather than minimal logic, is required in the 
conclusion. 
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Late in 2005, Grisha asked whether a version of the Dialectica interpretation 
designed by Shoenfield [42], for classical arithmetic, could be understood as 
a composition of the usual Dialectica interpretation together with a double- 
negation translation. I set the question aside and solved it a few months later 
[5], only to find that Ulrich Kohlenbach and Thomas Streicher had solved it 
more quickly [44] . In a way that can be made precise, the Shoenfield translation 
corresponds to the following version of the double-negation interpretation (itself 
a variant of a translation due to Krivine), expressed for a basis involving the 
connectives -■, A, V, and V. We define tp Kr to be -^ipxr, where pxr is defined 
recursively by clauses below. It helps to keep in mind that <pxr is supposed to 
represent the negation of ip: 



0Kr 


= -id, if 6 is atomic 


{-"P)kt 


= -iip Kr 


(ip A ip) Kr 


= tf Kr V IpKr 


(ip V ip) Kr 


= tfKr A IpKr 


(Vx ip) Kr 


= 3x ip Kr 



Note that we can eliminate either V or A and retain a complete set of connec- 
tives, but including them both is more efficient. Formulas of the form 3x ip, 
however, have to be expressed as -iVa; -up to apply the translation. 

THEOREM 5. For every formula ip, <p Kr «-> ip N is provable in minimal logic. 
Hence PA proves ip if and only if HA' proves p Kr . 

The -^-translation is particularly good when it comes to formulas in negation- 
normal form; it only adds two quantifiers for each existential quantifier, as well 
as one at the beginning. But one can do even better [2]. Taking advantage of 
the classical negation operator, now p M is defined to be ->(~v?)m, where the 
map ip n> ipM is defined recursively as follows: 



0m 


= 6, if 6 is atomic or negated atomic 


(ip V V)m 


= ip M V V>m 


(ip A V)m 


= ip M A ip M 


(3a; p) M 


= 3x ip M 


(Vx p) M 


= ->3x (~<^)m- 



Once again, we have 

THEOREM 6. For every formula <p in negation-normal form, p M o p N is 
provable in minimal logic. Hence PA proves p if and only if HA' proves p M . 

The - M -translation is extremely efficient with respect to negations, introduc- 
ing, roughly, one at the beginning of the formula, and one for every quantifier 
alternation after an initial block of universal quantifiers. Alternatively, can 
define (<p AiP)m in analogy to (Vx p)m, as ->((~<£)m V (^iP)m)- This gives the 
translation the nice property that given the formulas p M and (^p) M , one is 
the negation of the other. But there is a lot to be said for keeping negations 
to a minimum. 
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Of course, the • -translation extends to all classical formulas by identifying 
them with their canonical negation-normal form equivalents. Since the transla- 
tion relies on the negation-normal form representation of classical formulas, it 
shares many nice properties with a more complicated double-negation transla- 
tion due to Girard [17]. It is this translation that I will use, in the next section, 
to provide an efficient computational interpretation of classical arithmetic. 

Let me close with one more translation, found in [3], which is interesting in 
its own right. For reasons that will become clear later on, I will call it "the 
awkward translation": if tp is any formula in negation- normal form, let tp awk 
denote -i(~< / j). 

THEOREM 7. For any formula tp, tp N — > p awk is provable in minimal logic. 
Hence, PA proves tp if and only if HA' proves p awk . 

Proof. Once can show by induction that if tp is any formula in negation-normal 
form, then tp — > tp N is provable in minimal logic. So minimal logic proves that 
~ip implies (~<^) JV , and hence that ~'(~<^) JV implies p awk . But since ~<p is 
classically equivalent to -up, ^(^p) N is equivalent to -i-k/? jv , which is implied 
byp N . ' ■ 

The - au *-translation is almost absurdly efficient with respect to negations: 
the one classical negation on the inside adds no negations at all (recall that in 
arithmetic, negated atomic formulas have atomic equivalents), and the trans- 
lation adds only one negation on the outside. But the attentive reader will 
have noticed that the first assertion in Theorem 7 is slightly weaker than the 
corresponding assertions in the the theorems that precede it: only one direction 
of the equivalence is minimally valid. We will see, in Section 5, that this means 
that the translation fares very poorly with respect to modus ponens, making 
it impossible to translate ordinary proofs piece by piece. 

For pure first-order logic, an alternative proof of Theorem 7 can be found 
in [3] . Bcnno van den Berg (personal communication) later hit upon this same 
translation, independently. In [3], I claimed that with intuitionistic logic in 
place of minimal logic, the result is a consequence of a characterization of 
Glivenko formulas due to Orevkov, described in a very nice survey [30, Section 
3.2.5] of Russian proof theory by Grisha. 1 That seems to be incorrect; but van 
den Berg and Streicher have pointed out to me that in that case the result 
follows a theorem due to Mints and Orevkov [30, page 404, paragraph 4]. 

4 Interpreting classical arithmetic 

We now obtain direct computational interpretations of classical arithmetic sim- 
ply by combining the —translation of Section 3 with the computational inter- 
pretations of HA' given in Section 2. One annoying consequence of the use of 
the classical negation operator in the ^-translation is that it is impossible to 
carry out the translation of a formula tp from the inside out: depending on the 

1 There are typographical errors on page 401 of that paper, which Grisha has corrected for 
me. The last class eight lines from the bottom of the page should be {— ¥~ , ~ — , V+, the 
last class seven lines from the bottom of the page should be {— >~ , , V+, — > + , V+}; and 
the first class at the bottom line should be {— >+, ~+, V~}. 
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context in which a subformula ip occurs, the computational interpretation of 
the full formula may depend on either the computational interpretation of ip or 
the computational interpretation of In practice, then, it is often more con- 
venient to carry out the interpretation in two steps, applying the - M -translation 
first, and then one of the two computational interpretations described in Sec- 
tion 2. Nonetheless, it is interesting to see what happens when the steps are 
composed, which is what I will do here. Both translations apply to formulas 
in negation-normal form, and we can assume that negated atomic subformulas 
are replaced by their atomic equivalents. 

As in Section 2, the appropriate version of classical realizability is defined 
relative to a fixed primitive recursive predicate A(y). Most of the clauses look 
just like ordinary modified realizability: 

a realizes 9 = 9, if is atomic 

a realizes (pAip = ((a)o realizes <p) A ((a)i realizes <p) 
a realizes <pV ip = ((isleft(a) A left (a) realizes ip) V 

(isright(a) A right(a) realizes ip)) 
a realizes Bx ip(x) = (a)i realizes p((a) ) 

The only slightly more complicated clause is the one for the universal quantifier. 
Take a refutes p to be the formula V6 (b realizes ip — >• A(a(b))). 

a realizes Vx ip(x) = a refutes 3x ^p(x) 

One can then straightforwardly extract, from any proof of a formula p in 
classical arithmetic, a term a that refutes ~y. But now notice that the identity 
function realizes Vx A{x), where A is the negation of A; so, from a proof of 
By A(y) in classical arithmetic, since the identity function realizes Vx A(x), one 
obtains a term a satisfying A{a). This provides a direct proof of Theorem 1. 
Details can be found in [2]. A more elaborate realizability relation, based on 
the A-translation, can be found in [9]. 

The corresponding variant of the Dialcctica translation is similarly straight- 
forward. As with the Shocnfield variant [42, 5], each formula tp is mapped to a 
formula p D of the form Mx 3y pw (x, y), where x and y are sequences of vari- 
ables. Assuming ip D is Vu 3v ipo'{u,v), the translation is defined recursively, 
as follows: 

9 = 9, if 9 is atomic 

(p A ip) D = Vx, u By, v (ip D i (x, y) A ip D i (u, v)) 
(p V ip) D = Vx, u By, v (p D , (x, y) V ip D > (u, v)) 
(Vz p) D = Vz, x By p D , (x, y) 

This time, it is the clause for the existential quantifier that is slightly more 
complicated. If {^p(z)) D is Vr Bs {^p)o'{z,r,s), define 

{Bzpf = VSBz,r^p) D ,(z,r,S(z,r)). 
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On can interpret this as saying that for any function S(z, r) that purports to 
witness Mz,r 3s {~ip)D'{z,r,s), there are a z and an r denying that claim. 
Once again, one can straightforwardly extract, from any proof of a formula 
if in classical arithmetic, a term a satisfying Vx <fD'(x, a{x)). This provides 
another direct proof of Theorem 1. 

5 Back to the awkward translation 

I would now like to come back to the "awkward translation," discussed at the 
end of Section 3. I will do this via what at first might seem to be a digression 
through Kreisel's no- counterexample interpretation. Let if be a formula in 
prenex form, for example, 

3a; Vy 3z Vw 6(x, y, z, w). 

The Herbrand normal form f H of if is obtained by replacing the universally 
quantified variables of ip by function symbols that depend on the preceding 
existential variables, to obtain 

3x, z 9(x,f(x),z,g(x,z)). 

It is not hard to check that, in classical logic, if implies p H . Thus, by a slight 
variant of Theorem 1 (relativizing it to function symbols) , if classical arithmetic 
proves if, there will be terms F\(f,g) and F 2 (f,g) of PR" such that HA" proves 

yf,g9(F 1 (f,g),f(F 1 (f,g)),F 2 (f,g),g(F 1 (f,g),F 2 (f,g))). 

Think of / and g as providing purported counterexamples to the truth of if, so 
that Fi and F 2 effectively foil such counterexamples. The no-counterexample 
interpretation is simply the generalization of this transformation to arbitrary 
prenex formulas. 

One need not invoke Herbrand normal form to arrive at the previous con- 
clusion. One can check that if if is a prenex formula, the no-counterexample 
interpretation of ip is essentially just the Diabetica interpretation of ip awk , so 
the result follows from Theorem 7 as well. 

The no-counterexample interpretation can be viewed as a computational in- 
terpretation of arithmetic. But, in a remarkable article [25], Kohlenbach has 
shown that it is not a very modular computational interpretation, in the sense 
that it does not have nice behavior with respect to modus ponens. To make this 
claim precise, note that the set of (terms denoting) primitive recursive function- 
als, PR", can be stratified into increasing subsets PR", in such a way that any 
finite fragment of HA has a Dialectica interpretation (or modified realizability 
interpretation) using only terms in that set. Kohlenbach [25, Proposition 2.2] 
shows: 

THEOREM 8. For every n there are sentences if and %p of arithmetic such 
that: 

1. if is prenex. 

2. ip is a n 2 sentence, that is, of the form Vx 3y R(x, y) for some primitive 
recursive relation R. 
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3. Primitive recursive arithmetic proves tp. 
4- PA proves p — > tp. 

5. tp and every prenexation of tp ^ tp has a no- counterexample interpretation 
with junctionals in PRq . 

But: 

6. There is no term F o/PR^ which satisfies the no- counterexample inter- 
pretation oftp; that is, there is no term F such that Vx R(x, F(x)) is true 
in the standard model of arithmetic. 

Theorem 8 shows that there is no straightforward way to combine witnesses to 
the no-counterexample interpretations of tp and tp —> tp, respectively, to obtain 
a witness to the no-counterexample interpretation of tp. 

The problem with the awkward translation is that, similarly, it may behave 
poorly with respect to modus ponens. Consider pure first-order logic with a 
single predicate symbol, A(x). Then there are formulas tp and tp such that tp awk 
doesn't follow from (p awk and (<p — > %p) awk in minimal logic: just take tp to be 
the formula Vx A(x) and tp to be _L. In that case, <p awk A (tp -> tp) awk -> tp awk 
is equivalent, over minimal logic, to the double-negation shift, Vx -i->A(x) — > 
->-iA(x), which is not even provable in intuitionistic logic. 

When I showed the awkward translation to Grisha, he remarked right away 
that its behavior has something to do with Kohlenbach's result. At the time, I 
had no idea what he meant; but writing this paper finally prodded me to sort 
it out. Grisha was right: Theorem 8 can, in fact, be used to show that the awk- 
ward translation does not provide a modular translation of Peano arithmetic 
to Heyting arithmetic, in the following sense. 

THEOREM 9. For any fragment T of HA, there are formulas tp and tp such 
that the following hold: 

1. PA proves tp and p — > tp, but 

2. T together with tp awk and {tp ->■ %p) awk does not prove tp awk . 

This shows that modus ponens fails under the - awk -translation, in a strong way. 
Since PA proves tp and tp — > tp, it also proves tp, and so by Theorem 7, HA 
proves tp awk . But having the translation tp awk and (p —> tp) awk may not help 
much in obtaining such a proof of ip awk ; indeed, obtaining a proof of tp awk from 
tp awk and (<p — > tp) awk may be no easier than simply proving ip awk outright. 

Proof. Given T, first let n be large enough so that the Dialectica interpreta- 
tion of T uses only terms in PR^, and then let tp and tp be as in Theorem 8. 
If there were a proof of tp awk from p awk and (tp — > ip) awk in T, applying the 
Dialectica interpretation, one would obtain terms witnessing the Dialectica in- 
terpretation of tp awk from terms witnessing the Dialectica interpretations of 
tp awk and (tp — > tp) awk . But the Dialectica interpretation of tp awk is the no- 
counterexample interpretation of tp, and it is not hard to check that the Di- 
alectica interpretation of (tp — > tp) awk is the no-counterexample interpretation 
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of one of the prenexations of ip — > ip. Thus there would be a witness to the 
no-counterexample interpretation of ip in PR^, contrary to the choice of ip and 

ip. m 



6 Conclusions 

As noted in the introduction, conventional wisdom holds that classical logic is 
"nondetcrministic," in that different ways of extracting algorithms from clas- 
sical proofs can yield different results. Sometimes, however, nondeterminacy is 
unavoidable. For example, even minimal logic can prove 3x A(x) A 3y A(y) — > 
3z A(z), and any computational interpretation of this formula will have to 
choose either x or y to witness the conclusion. The difference is that this sen- 
tence is typically not taken to be an axiom of minimal logic; rather, there are 
two axioms, p A ip — > <P and p A ip — > ip, and any proof of the sentence has to 
choose one or the other. In contrast, standard calculi for classical logic provide 
cases where there are multiple choices of witnesses, with no principled reason 
to choose one over the other. For example, starting from canonical proofs of 
A(a) — > 3x A(x) and A(b) — > 3x A(x), one can weaken the conclusions to 
obtain proofs of <p — > (A(a) — > 3x A(x)) and -up — > (A(b) — > 3x A(x)), for an 
arbitrary formula, <p. Then, using the law of the excluded middle, <p V -up, one 
can combine these to obtain a proof of A(a) A A(b) — > 3x A{x) where there is 
little reason to favor a or b as the implicit witness to the existential quantifier. 
(This example is essentially that given by Lafont [16, p. 150].) 

This shows that standard classical calculi arc nondetcrministic in a way that 
proofs in intuitionistic and minimal logic are not. Girard [17] has neatly diag- 
nosed the source of the nondeterminacy, and has provided a calculus for clas- 
sical logic that eliminates it by forcing the prover to make an explicit choice 
in exactly those situations where an ambiguity would otherwise arise. 2 But 
note that the realizability interpretation of Section 4 also avoids this non- 
determinacy; the translation procedure described in [2] is fully explicit and 
unambiguous. With respect to the example discussed in the last paragraph, 
the interpretation chooses a or b based on the logical form of (p. 3 Indeed, the 
results of [2] show that the witnesses obtained in this way coincide with those 
obtained using a natural class of cut-elimination procedures. 

Yet another response to the example above is that of Urban and Bierman 
[47, 48], who simply embrace the nondctcrminism as an inherent part of the 
computational interpretation of classical logic. In fact, Urban [47] has provided 
a nondeterministic programming language to interpret classical logic in a nat- 
ural way. Passing through a double-negation interpretation, as we have done 
here, amounts to making specific choices to resolve the nondeterminism. It is 

2 One way to understand what is going on is to notice that in minimal logic there are two 
distinct ways of proving -1-1(95 A?/ 1 ) from -1-195 and — ■ — and this inference is needed to verify 
the classical axiom —1—1O — > 9 under the double-negation translation. Another way is to notice 
that since, in classical logic, 95 and -1-195 arc equivalent, there is little reason to favor 95 or 
-195 in situations where intuitionistic logic treats them differently. I am grateful to Thomas 
Strcicher for these insights. 

3 More generally, what breaks the symmetry alluded to in the previous footnote is that 95 
and -1-195 have the same negation-normal form, which is distinct from (and dual to) that of 
-.95. 
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an open-ended conceptual problem to understand which deterministic instances 
of the general nondeterministic algorithms can be realized in such a way. 

At this point, however, we should be clearer as to the goals of our analy- 
sis. Ordinary mathematical proofs are not written in formal languages, and so 
the process of extracting an algorithm from even a rather constructive math- 
ematical argument can involve nondeterminism of sorts. And, despite some 
interesting explorations in this direction [10], it is far from clear that classi- 
cal arithmetic can be used as an effective programming language in its own 
right. But formal methods are actively being developed in support of software 
verification [21], and a better understanding of the computational content of 
classical logic may support the development of better logical frameworks for 
that purpose [40]. Formal translations like the ones described here have also 
been effective in "proof mining," the practice of using logical methods to extract 
mathematically useful information from nonconstructive proofs [7, 24, 26]. 

Grisha's work has, primarily, addressed the general foundational question as 
to the computational content of classical methods. In that respect, the general 
metatheorems described here provide a satisfying answer: for the most part, 
classical mathematical reasoning does have computational content, which is 
to say, algorithms can be extracted from classical proofs; but by suppressing 
computational detail, the proofs often leave algorithmic detail underspecified, 
rendering them amenable to different implementations. Grisha's work has thus 
contributed to an understanding of the computational content of classical arith- 
metic that is mathematically and philosophically satisfying, providing a solid 
basis for further scientific research. 
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